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ABSTRACT 


Data Deduplication technique is used to protect the confidentiality of 
sensitive data while supporting deduplication, the convergent 
encryption technique has been proposed to encrypt the data before 
outsourcing for better data security, this approach makes the first 
the problem of authorized : 
datadeduplication. Different from traditional deduplication systems, | Er 
the differential privileges of users are further considered in duplicate 
check besides the data itself. Several new deduplication constructions 
are proposed supporting authorized duplicate check in a hybrid cloud 
architecture. Security analysis demonstrates that this scheme is secure 
in terms of the definitions specified in the proposedsecurity model. 


attempt to formally address 
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1. INTRODUCTION 

Data deduplication is a specialized data compression 
technique for eliminating duplicate copies of 
repeating data in storage. The technique is used to 
improve storage utilization and can also be applied to 
network data transfers to reduce the number of bytes 
that must be sent. Instead of keeping multiple data 
copies with the same content, deduplication 
eliminates redundant data by keeping only one 
physical copy and referring other redundant data to 
that copy. Deduplication can take place at either the 
file level or the block level. For file level 
deduplication, it eliminates duplicate copies of the 
same file. Deduplication can also take place at the 
block level, which eliminates duplicate blocks of data 
that occur in non-identical files. 


2. LITERATURE SERVEY 

“Survey on Authorized Data Deduplication 
System using Cryptographic and Access Control 
Techniques” Santoshi S Patil, Samprati T, Asst. 
Prof. Swetha K S,published in 2014 


Ever increasing volume of back up data in cloud 
storage may be a vital challenge back up windows are 
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shrinking due to growth of information. We use the 
concept of deduplicate. Deduplication means 
duplicate data is eliminated a pointer is created to 
reference a data that is backed up. Deduplication can 
take place at file level, in this it detects redundant 
data within and across files or at the block level, in 
this it removes redundant copies of identical files 


3. EXISTINGSYSTEM 

Traditional encryption, while providing data 
confidentiality, is incompatible with data 
deduplication. Specifically, traditional encryption 
requires different users to encrypt their data with their 
own keys. Convergent encryption has been proposed 
to enforce data confidentiality while making 
deduplication feasible. It encrypts/decrypts a data 
copy with a convergent key, which is obtained by 
computing the cryptographic hash value of the 
content of the data copy. After key generation and 
data encryption, users retain the keys and send the 
cipher text to the cloud. Identical data copies will 
generate the same convergent key and hence the same 
cipher text. 
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3.1. Disadvantages of the existing system 

> Existing deduplication systems cannot support 
differential authorization duplicate check, which 
is important in many applications. 


> Traditional deduplication systems based on 
convergent encryption, although providing 
confidentiality to some extent, do not support the 
duplicate check with differential privileges. 


4. PROPOSEDSYSTEM 

Aiming at efficiently solving the problem of 
deduplication with differential privileges in cloud 
computing, a hybrid cloud architecture is considered 
consisting of a public cloud and a private cloud. 
Private cloud is involved as a proxy to allow data 
owner/users to securely perform duplicate check with 
differential privileges. Such an architecture is 
practical and has attracted much attention from 
researchers. The data owners only outsource their 
data storage by utilizing public cloud while the data 
operation is managed in private cloud. A new 
deduplication system supporting differential duplicate 
check is proposed under this hybrid cloud architecture 
where the S-CSP resides in the public cloud. The user 
is only allowed to perform the duplicate check for 
files marked with the corresponding privileges. 
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5. CONCLUSION 

In this project, the notion of authorized data 
deduplication was proposed to protect the data 
security by including differential privileges of users 
in the duplicate check. We also presented several new 
deduplication constructions supporting authorized 
duplicate check in hybrid cloud architecture, in which 
the duplicate-check tokens of files are generated by 
the private cloud server with private keys. Security 
analysis demonstrates that our schemes are secure in 
terms of insider and outsider attacks specified in the 
proposed security model. As a proof of concept, we 
implemented a prototype of our proposed authorized 
duplicate check scheme and con-duct testbed 
experiments on our prototype. 
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